Survey and Analysis of Production Distributed Computing Infrastructures

نویسندگان

  • Daniel S. Katz
  • Shantenu Jha
  • Manish Parashar
  • Omer F. Rana
  • Jon B. Weissman
چکیده

Context The material in this report is a draft of a large part of Chapter 3 of " Abstractions for Distributed Applications and Systems, " a book being written to be published by Wiley in 2012. This report primarily covers production distributed computing infras-tructures that have been used to develop and deploy large-scale scientific applications. We define a production distributed computing infrastructure as a set of computational hardware and software, in multiple locations, intended for use by multiple people who are not the developers of the infrastructure. We observe that typically the time scales over which scientific applications are developed and used is qualitatively larger than the time scales over which the underlying infrastructure tends to evolve. For instance, the middleware used and the services and interfaces offered by many distributed computing infrastructures have changed over recent years due to changes in providers and other technical, political, and funding reasons. Additionally, some of the commercial infrastructures themselves have developed relatively recently. However, one component of this landscape has essentially remained the same: scientific applications and the most commonly used methods used to develop them. The relatively slow evolution of scientific applications is both an opportunity and a challenge. It is a challenge in that once developed , they are hard to modify and adapt to changes in infrastructure. It is an opportunity in the sense that if we can design and architect scientific applications correctly they will be immune to shifts in the underlying infrastructures! Given the many changes in academic computing infrastructures the world over, and the fast evolution of commercial infrastructures, this report is an attempt to provide a topical and focused analysis of distributed computing infrastructures. The book from which this report has originated provides: (i) a critical assessment of a number of existing scientific applications and infrastructures – to identify gaps between application requirements and the abstractions and capabilities provided by the current generation of systems and infrastructure ; (ii) a survey of 13 application case studies; (iii) survey of coordination abstractions and infrastructures currently employed by distributed applications , in particular identifying mechanisms that may have benefit for future applications (in addition to those surveyed); and (iv) a survey and assessment of abstractions and infrastructures within the emerging area of data intensive applications. The book is, in part, a consequence of what we per-1 ceive to be a lack of sufficient connection between: (i) the …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-agent systems for protecting critical infrastructures: A survey

Multi-agent systems have emerged as a very significant platform in provisioning distributed and collaborative services to critical applications. Such applications require ubiquitous agent presence in the environment for monitoring, collecting data, communication, and subsequent data analysis, where the sensitivity of the application’s nature cannot be understated. Recent advances in the field o...

متن کامل

TASA: A New Task Scheduling Algorithm in Cloud Computing

Cloud computing refers to services that run in a distributed network and are accessible through common internet protocols. It merges a lot of physical resources and offers them to users as services according to service level agreement. Therefore, resource management alongside with task scheduling has direct influence on cloud networks’ performance and efficiency. Presenting a proper scheduling ...

متن کامل

Data Management in Production Grids - Challenges and Techniques

Advances in networking and distributed computing allowed the establishment of production Grid infrastructures during the past few years. Today, large-scale production Grid infrastructures such as EGEE in Europe, OSG in the US, and NAREGI in Japan are offering their services to many scientific and industrial applications, from domains as diverse as Astronomy, Biomedicine, Computational Chemistry...

متن کامل

Peer Production of Survivable Critical Infrastructures

The paper describes an approach to provisioning survivable critical communications and computation infrastructures by deploying radically distributed, peer-based systems for communication, computation, and data storage and retrieval. The paper focuses on system survivability as an approach to robustness, rather than on artifact-oriented hardness or impregnability. Based on a survey of experienc...

متن کامل

Optimization Task Scheduling Algorithm in Cloud Computing

Since software systems play an important role in applications more than ever, the security has become one of the most important indicators of softwares.Cloud computing refers to services that run in a distributed network and are accessible through common internet protocols. Presenting a proper scheduling method can lead to efficiency of resources by decreasing response time and costs. This rese...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1208.2649  شماره 

صفحات  -

تاریخ انتشار 2012